Evaluation Of Annotation Schemes For Japanese Discourse Japanese Discourse Tagging Working Group
نویسندگان
چکیده
This paper describes standardizing discourse annotation schemes for Japanese and evaluates the reliability of these schemes. We propose three schemes, that is, utterance unit, discourse segment and discourse markers. These schemes have shown to be incrementally improved based on the experimental results, and the reliability of these schemes are estimated as "good" range.
منابع مشابه
Japanese Dialogue Corpus of Multi-Level Annotation
This paper describes a Japanese dialogue corpus annotated with multi-level information built by the Japanese Discourse Research Initiative, Japanese Society for Artificial Intelligence. The annotation information consists of speech, transcription delimited by slash units, prosodic, part of speech, dialogue acts and dialogue segmentation. In the project, we used the corpus for obtaining new find...
متن کاملAnnotation and Analysis of Discourse Relations, Temporal Relations and Multi-Layered Situational Relations in Japanese Texts
This paper proposes a methodology for building a specialized Japanese data set for recognizing temporal relations and discourse relations. In addition to temporal and discourse relations, multilayered situational relations that distinguish generic and specific states belonging to different layers in a discourse are annotated. Our methodology has been applied to 170 text fragments taken from Wik...
متن کاملAn Efficient Statistical Speech Act Type Tagging System for Speech Translation Systems
This paper describes a new efficient speech act type tagging system. This system covers the tasks of (1) segmenting a turn into the optimal number of speech act units (SA units), and (2) assigning a speech act type tag (SA tag) to each SA unit. Our method is based on a theoretically clear statistical model that integrates linguistic, acoustic and situational information. We report tagging exper...
متن کاملThe Automatic Translation of Discourse Structures
We empirically show that there are significant differences between the discourse structure of Japanese texts and the discourse structure of their corresponding English translations. To improve translation quality, we propose a computational model for rewriting discourse structures. When we train our model on a parallel corpus of manually built Japanese and English discourse structure trees, we ...
متن کاملA Discourse Structure Analysis of Technical Japanese Texts and Its Implementation on the WWW*
This paper deals with a discourse structure analysis of technical Japanese texts for developing a Japanese writing Computer Assisted Language Learning (CALL) system whose goal is to assist students in learning to write technical Japanese texts. To analyze discourse structures of technical Japanese texts, cohesive expressions are used as cue words. The rules for analyzing texts are based on micr...
متن کامل